Description

Data Science is a dynamic field of Statistics and Computer Science. The emergence of large datasets with billions of observations makes primary impetus for the field. Such datasets come in astronomy, large-scale retailing, telecommunications, and social media. This course is designed to emphasize on practical techniques of working with large data. Topics covered include statistical modelling, data pipelines, machine learning, programming languages, real-world topics, big data tools, and case studies.

This is an instructor-led course with an average batch size of 4 students. In the 60 hours of Online Live training, you will get both the theoretical and practical knowledge needed to build the necessary skills. The trainer’s holistic approach is stemmed to meet the long-term needs of the student and hence they provide 100% job/placement assistance with the option of seeking a trial class before the enrolment.

What Will I Learn?

  • You will learn Statistics, Probability theory and Hypothesis testing,
  • You will learn programming languages like Python and R programming.
  • You will learn machine learning and deep learning concepts.

Specifications

  • Free Demo
  • Learn from Experts
  • Interactive Learning
  • Missed Class Recovery
  • Instalment Facility

Introduction  To Data Science

  • The life cycle of Data Science

Statistics

  • Statistical Learning 
  • Measures of central tendency
  • Measures of dispersion
  • Probability theory
  • Hypothesis testing,
  • ANOVA
  • Types of graphs and
  • plots

R Programming 

  • R Environment Setup and Essentials
  • Installing R for the Windows, Linux and Mac
  • Exploratory data analysis
  • Basic operators in R
  • Data Manipulation
  • Data visualisation
  • Followed byHands-on Exercise

Python

  • Python language Basic
  • Constructs
  • OOP concepts in Python
  • Hands-on Exercise – important concepts in OOP like polymorphism, inheritance, encapsulation, Python functions, return types, and parameters, Lambda expressions
  • NumPy for mathematical computing
  • Hands-on Exercise – How to import NumPy module, creating an array using ND-array
  • calculating standard deviation on an array of numbers, calculating the correlation between two variables.
  • SciPy for scientific computing
  • Hands-on Exercise – Importing of SciPy, applying the Bayes theorem on the given dataset.
  • Matplotlib for data visualization
  • Hands-on Exercise – deploying MatPlotLib for creating Pie
  • Scatter, Line, Histogram.
  • Pandas for data analysis and machine learning
  • Hands-on Exercise – working on importing data files, selecting record by a group, applying a filter on top, viewing records, analyzing with linear regression, and creation of time series.
  • Python Environment Setup and Essentials Installing Python Anaconda for the Windows, Linux and Mac with Hands-on Exercise

Machine Learning

  • Introduction to Machine
  • Learning with R and
  • Python
  • The need for Machine Learning,
  • Introduction to Machine
  • Learning, types of Machine
  • Learning, such as supervised
  • unsupervised and reinforcement learning, why Machine Learning with Python, R and applications of Machine Learning.
  • Supervised Learning and
  • Linear Regression
  • Hands-on Exercise – Implementing linear regression from scratch with R and Python, Using Python library Scikit-learn to perform simple linear regression and multiple linear regression, Implementing train– test split and predicting the values on the test set.
  • Classification and Logistic Regression
  • Hands-on Exercise – Implementing logistic regression from scratch with R and Python, Using Python library Scikit-learn to perform simple logistic regression and multiple logistic regression, Building a confusion matrix to find out the accuracy, true positive rate, and false-positive rate.
  • Decision Tree and Random Forest
  • Hands-on Exercise – Implementing a decision tree from scratch in R and Python, Using Python library Scikit-learn to build a decision tree and a random forest, Visualizing the tree and changing the hyperparameters in the random forest.
  • Naïve Bayes and Support Vector Machine
  • Hands-on Exercise – Using Python library Scikit-learn to build a Naïve Bayes classifier and a support vector classifier.
  • Unsupervised Learning
  • Hands-on Exercise – Using Python library Scikit-learn to implement K-means clustering, Implementing PCA (principal component analysis) on top of a dataset.

Deep Learning As Part AI

  • Natural Language Processing and Text Mining

Capstone

  • Project Time Series Analysis
  • Hands-on Exercise – Analyzing time series data, the sequence of measurements that follow a non-random order to recognize the nature of the phenomenon, and forecasting the future values in the series.

Tableau

  • Tableau for data visualisation:
  • Tableau Introduction
  • Working on data with Tableau
  • Dashboards using Tableau (hands-on)
  • Stories in Tableau (hands-on)

Ms.Abidunnisa Begum

The trainer has Industry & teaching experience in R, Python, Tableau, Data Science, and Machine Learning.

Trainer's Technical Summary:

  • Technical expertise in design, development, integration, and up-gradation of applications.
  • Good understanding of aof numbers, statistics, mathematical concepts, and algorithms.
  • Experience and understanding of Machine Learning, e.g.: linear/logistic regression, random forest, SVM, etc – Using R Studio.
  • Machine Learning and Text Analytics using R Studio.
  • Knowledge of HTML Component Development, Master, and detail view development.
  • Worked on Custom requirements & Extensibility for HTML.
  • Knowledge of using Tableau to analyze and visualize data.
  • Connect Tableau to a variety of data sets
  • Analyze, blend join, and calculate data.
  • Visualize data in the form of various charts, plots, and Maps.
  • Convert raw data into compelling Data Visualization with Tableau.

No reviews found

Batch Start Date End Date Timings Batch Type
No video found

Description

Data Science is a dynamic field of Statistics and Computer Science. The emergence of large datasets with billions of observations makes primary impetus for the field. Such datasets come in astronomy, large-scale retailing, telecommunications, and social media. This course is designed to emphasize on practical techniques of working with large data. Topics covered include statistical modelling, data pipelines, machine learning, programming languages, real-world topics, big data tools, and case studies.

This is an instructor-led course with an average batch size of 4 students. In the 60 hours of Online Live training, you will get both the theoretical and practical knowledge needed to build the necessary skills. The trainer’s holistic approach is stemmed to meet the long-term needs of the student and hence they provide 100% job/placement assistance with the option of seeking a trial class before the enrolment.

What Will I Learn?

  • You will learn Statistics, Probability theory and Hypothesis testing,
  • You will learn programming languages like Python and R programming.
  • You will learn machine learning and deep learning concepts.

Specifications

  • Free Demo
  • Learn from Experts
  • Interactive Learning
  • Missed Class Recovery
  • Instalment Facility

No Comments

Please login to leave a review

Related Classes